









wl I), & 
“SS! «*® "Ang secsaionaos 
—— 4, e 
=f OO: ENG Transactions 
Vy Journal homepage: http://www.engtransactions.com ae 





ENG Transactions, vol. 2, pp. 1-12, June 2021 
ID: ENGT-21061221 12348 


Research Article 


Prediction for Monthly Rainfall of Six Meteorological Regions and TRNC (Case 
Study: North Cyprus) 


Mir Mohammad Ali Malakoutian'*, Yas Malakoutian”, Pooria Mostafapour’, Sina Ziafat Dost Abed* 


'Department of Architecture and the Built Environment, University of the West of England, Bristol, United Kingdom 
2Department of Art and Architecture, University of Guilan, Rasht, Iran 

3Department of Business, The University of Texas at Arlington, Arlington, USA 

#Department of Civil engineering, Islamic Azad University, Arak Branch , Arak, Iran 


Keywords Abstract 


Forecasted data, This paper works with the monthly rainfall of six meteorological regions and TRNC (North 


predict Cyprus) as a whole for the hydrologic years from September 1975 to August 2014 period. 


rainfall In order to predict 5 years ahead of the yearly rainfall of each meteorological region and 


; TRNC, three different time series models (Markov, Auto-regressive (AR) and Holt-Winter 
time series models, 


TRNC Multiplicative) were used. For this reason, the rainfall of hydrologic years from 1975-76 to 


2003-04 were used for training and from 2004-05 to 2013-14 were used for forecasting 
(testing) the trained data. The best representative time-series model for each region was 
selected based on the standardized averages of four statistical error checking measures 
(MAPE, MAD, MSE and RMSE). The selected model for each region was then used to 
predict (estimate) the rainfall for five successive hydrologic years ahead from 2014-15 to 


2018-19. 


1. Introduction 


1.1 Study Area 
The study area is North Cyprus which is divided to 6 regions (Central Mesaria,East Coast, East Mesaria, Karpaz, North Coast, 
West Mesaria, and TRNC as general). 


1.2 Time series 


A collection of organized observations of a quantitative variable taken at successive points in time is called a time series. Time 
in terms of years, months, days, or hours is a tool that permits one to connect occurrence to a set of common, stable reference 


points [1]. 
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Time series forecasting is the use of a model to predict future values based on previously observed values.There are two main 
goals of time series analysis: (a) identifying the nature of the phenomenon represented by the sequence of observations, and (b) 
forecasting (predicting future values of the time series variable). Both of these goals require that the pattern of observed time 


series data is identified and more or less formally described. 


1.3 Stationary Time Series 


A time series is stationary if it is free of trends, shifts, or periodicity. It means that the statistical parameters of the series such 
as mean and variance remain constant through 62 time. Otherwise the time series is non-stationary. Generally hydrologic time 
series defined on an annual time scale are stationary [2]. The stationary test should be done to determine if the mean values and 
variances of the series vary with time. Other recent suggested methodes can be found in [3-8]. Stationary time series implies that 
none of the data varies with time. In this study the Augmented Dickey — Fuller (ADF) test is used to check Stationarity of times 


series: 


1.4 Augmented Dickey—Fuller Unit Root Test 


Augmented Dickey—Fuller unit root test (ADF) as its name refers is a test for a unit root in any time series. Being augmented 
implies larger and more complicated set of time series models. 
Ayt =a+ Bt + yyt-1 + d1Ayt—-1 + +++ dp—IAyt-—p-—I + et (1) 
where @ is a constant, B the coefficient on a time trend and p is the lag order of the autoregressive process. 


By including lags of the order p the ADF formulation allows for higher-order autoregressive processes. This means that the 
lag length p has to be determined when applying the test. The unit root test is then carried out under the null hypothesis y > 0 
against the alternative hypothesis of y < 0. In this study Autoregressive order | (AR1) is used, therefore: 


Ayt=a+ Bt + yyt-I + et (2) 
HO: y = 0 implies that the data is stationary, H1: y < 0 implies that the data is non-stationary. 
Result : Central Mesaria rainfall (time series) is stationary. For other regions also this test was applied: 


Table1. ADF test result of all regions 


Region y= Slope of regression comment 
Central Mesaria 0.43>0 Time series is Stationary 
East Coast 0.28>0 Time series is Stationary 
East Mesaria 0.34>0 Time series is Stationary 
Karpaz 0.26>0 Time series is Stationary 
North Coast 0.27>0 Time series is Stationary 
West Mesaria 0.08>0 Time series is Stationary 
TRNC 0.33>0 Time series is Stationary 


2. Methodology 


Forecasting model involves the selection of an estimation procedure. A forecast after all, is an estimate of a future outcome 


of a random process. In this study three forecasting models were used: 
2.1 Markov Model 


The Markov pocess considers that the value of an event at one time is correlated with the value of the event at an earlier 


period. In a first-order Markov process, this correlation exists in two consecutive values of the proceedings. The first order 
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Markov model, which comprises the classic method in synthetic hydrology, declares that the value of a variable y in one time 


period is dependent on the value of ‘y’ in the preceding time period plus a random component. 
yi = di + Ej (3) 


where y;= rainfall at i" year, di= deterministic part of i year, and ¢; = random part of i" year. 


The values of ¢| are connected with the historical data by certifying that they belong to the same frequency distribution and 
possess similar statistical properties (mean, deviation, skewness) as the historical series [9].A variety of forms and combinations 
of deterministic and random component are established as different models. Single season (annual) rainfall model of lag | is the 
simplest model which presumes that the amount of the current rainfall 1s considerably correlated with the previous one value 
only [9]. 

If the Markov model’s parameters are estimated from data, the standard maximum likelihood estimates consider the first 
order (single step) transitions only. But for many problems, the first order conditional independence assumptions are not satisfied 


as a result of the higher order transition probabilities can be poorly approximated by the learned model [14]. 


Formulation of the Markov model for yearly data [9]: 
x =X 4+7(%j-1-X) +SVA- 77); (4) 


where x; is the rainfall at i" year, x is the mean of data, r,is lag one — autocorrelation coefficient , S is standard deviation of 
data, and t; is random variate from an appropriate distribution with a mean of zero and variance of unity. For obtaining ¢; the 
random number should be generated, and it did by Microsoft Excel in this study, as well inverse error function erf~*(z)should 
be calculated: 


~1 me! BE oe he Te oe ETE oy ) 
erf (2) =2Vn(z+=z Poe hae (5) 


Value of z can be obtained from cumulative distribution function (CDF) of the log normal distribution. 


If erf(x) = y, then erf~*(y) = x. Let 


Inx-1 = 





The value of t = Inx. Therefore 


lInx = (yv2) +1 (7) 


2.1 East Mesaria region rainfall as a sample for establishing Markov Model 


In order to establish the Markov model for East Mesaria rainfall the mean, autocorrelation coefficient (rl), standard deviation, 


random number, and other relevant parameters were determined automatically with help of Excel software and the results were 


tabulated below. 
Table 2. Markov Model of Central Mesaria rainfall 
Year ppt mean rl standard rand Z erf-1 ti] xi(F forecasting) 
deviation 
1975-1976 355.4 324.2 0.12 94.8 0.999 0.998 1.327 2.877 556.0 
1976-1977 260.2 324.2 0.12 94.8 0.969 0.938 1.170 2.655 601.9 
1977-1978 345.8 324.2 0.12 94.8 0.985 0.969 1.249 2.766 617.8 
1978-1979 292.3 324.2 0.12 94.8 0.800 0.601 0.595 1.841 532.7 
1979-1980 354.0 324.2 0.12 94.8 0.782 0.564 0.550 1.778 516.5 
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2010-2011 345.0 324.2 0.12 94.8 0.214 -0.571 -0.559 0.209 352.4 
2011-2012 466.3 324.2 0.12 94.8 0.116 -0.769 -0.834 -0.180 310.7 
2012-2013 366.1 BZA 2 0.12 94.8 0.365 -0.270 -0.244 0.655 384.2 
2013-2014 196.2 324.2 0.12 94.8 0.035 -0.931 -1.153 -0.631 272.0 


2.2 Auto-regressive (AR) Model 
General expression of Auto-regressive model can be defined by: 
Ve = PrYe-1 + P2Vt-2 t+ + PyVt-p + Et (8) 


where (1, P2, +. Pp are autoregressive coefficients, and €, is white noise (residuals) and p is the order of Auto-regressive 


model.Autoregressive coefficients 6, can be computed with below matrix form as given in following equation: 


1 % % % 1% Tp-1 

r, 1 % %% Pe Ty-2 

me % 2 Wi Ge mans (8a) 
Tp-1 'p-2 'p-3 Tp-4 'p-5 ee ee 1 


where r is correlation that is a measure of the relation between two or more variables. Correlation coefficients can range from 


-1.00 to +1.00. The autocorrelation function can be defined as: 
—k Xi- HW) Xise- 
r= yn eet i+ 9 
k= Diet wy (xm)? O) 
In this study Minitab16 software was used for finding correlations. Before applying autoregressive model to time series, 
order of Autoregressive model (P) should be defined. In this study Akaike information Criterion is used to find the best order of 
Autoregressive between AR(1), AR(2), AR(3) in order to show best Autoregressive model for deriving suitable synthetic series 


of data. After modelling series the time series is forecasted by the best Autoregressive model based on the AIC number [2][10]. 


2.3 Akaike Information Criterion 


The Akaike information criterion (AIC) is a measure of the relative quality of standard models or a given set of data. AIC 
estimates the quality of each model, relative to each of the other models. Hence, AIC provides a means for model selection [11]. 


Akaike recommends the following relationship for Autoregressive model: 


AIC(p) = min(n. Ino,” + 2(p)) (10) 


where n is the sample size, o,7 is the maximum likelihood estimate of the residual variance, p is the order of autoregressive 


model.The model, which gives the minimum AIC number, is the one to be selected. 


2.4 Steps in Calculating AIC number for Central Mesaria Rainfall 


e From the measured and hydrologic yearly averaged rainfall, from 1975-76 to 
2003-2004 were used to establish AIC numbers. Note that averaged values’ from 
hydrologic years 2004-05 to 2013-14 data will be used to check the prediction. 

e Auto-correlation coefficients (rk) is calculated using Minitab 16. 

e Auto-regressive coefficients of three different orders (AR(1), AR(2), and AR(3)) 
are used in this. study and their coefficients by solving the matrix given in _ Equation 


8a were determined through Excel software. 
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e First order autoregressive model (AR1) equation is : 


y= 0.39 Vi-1 + or 


e Second order autoregressive model (AR2) equation is : 


ed 0.43 Vea 0.1 Vi-2 + Ej 


e Third order autoregressive model (AR3) equation is : 


Vi = 0.4 Vi-1 + 0.01 0.28 Vi-3 + cj 


e for ARI: 


y,? = (0.39 y;_, + €;)? 12 = 0.397 * y,_17 +2 * 0.39 * y;_, * 0, + 0,7 
6° S.0:372 


e For AR2: o,* = 0.8 

e For AR3: o,* = 0.77 

e AIC numbers are: 
For ARI: AIC=29*In(0.372)+2(1) = -26.68 
For AR2: AIC=29*1n(0.8)+2(2)= -2.47 
For AR3: AIC=29*In(0.77)+2(3)= -1.58 


Table 3. Autocorrelation Coefficients of Central Mesaria rainfall data 


Lag (K) Autocorrelation Coefficients (rx) 
1 0.39 
2 0.05 
3 -0.26 
4 -0.14 
3) -0.03 
6 -0.08 
7 -0.02 
8 0.05 
9 0.18 
10 -0.12 
11 -0.21 
12 -0.32 
13 -0.07 
14 0.10 
15 0.06 
16 0.11 
17 0.03 
18 -0.03 
19 -0.14 

20 -0.21 
21 -0.12 
22 0.04 
23 0.11 
24 0.02 
25 0.01 
26 0.00 
21 0.05 
28 0.02 


e The minimum AIC number which is AR(1) model for this region is selected. 
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Table 4. AIC number of regions and TRNC 


region Akaike information criteria(AIC) 
AR(1) AR(2) AR(3) 
Central Mesaria -26.68* -2.47 -1.58 
East Coast 67.26 3.12 5312, 
East Mesaria -6.34 2.82 4.21 
Karpaz -5.21 2.82 4.82 
North coast -3.06 2.82 4.82 
West Mesaria 0.82 4.00 6.00 
TRNC -5.21 2.82 -107.45 


* The bold numbers are representing the relevant AR model of that region 1.e being the smallest value among each row. 


2.5 Derivation of the synthetic sequence 


After obtaining the suitable model from Akaike information criteria, the derivation of synthetic sequences were obtained by 
finding the values of normal and independent residual ¢; [10]. The residual can be defined as if obeys normal distribution by: 
Ej = Ue + O,L; (15) 
where sé is the mean of residuals, o, is the standard deviation of residuals, Z7;,, Z2; represent the standard normal random 
numbers that must be calculated by using the uniform random numbers, 7; varying randomly between O and 1. The standard 
normal random numbers are defined by: 


Z; = (—2lnn,;)'/?cos(2102;) (16) 
Zi = (—2lnn,;)'/’sin(2172;) (17) 
2.6 Derivation of AR(1) Model for Central Mesaria 


Through AR(1) model, applying Eq. (15), (16), and (17) to hydrologic yearly averaged measured rainfall values of Central 
Mesaria regions, their relevant synthetic values were generated and tabulated for the period of hydrologic years 1975-76 to 2003- 


04 below. 


Table 5. Synthetic data generated by AR(1) model for Central Mesaria 


uniform random Standard normal Residual for yivalues for yiSynthetic values for 


number random number AR(1) model AR(1) model AR(1) 
n1 0.31 Zi -0.38 C1 -0.23 0 yl 302.95 
n2 0.71 Z2 -1.48 €2 -0.90 -0.90 y2 237.16 
n3 0.80 Z3 -0.60 €3 -0.36 -0.71 y3 250.87 
n4 0.43 ZA 0.29 C4 0.18 -0.10 y4 295.12 
15 0.83 Z5 -0.39 €5 -0.24 -0.28 y5 282.62 
126 0.22 Zr 0.37 €26 0.23 0.24 26 320.16 
127 0.29. Zaz -1.58 €27 -0.96 -0.87 y27 239.50 
128 0.49 Z28 0.06 €28 0.04 -0.30 y28 281.24 
N29 0.44 Z29 -1.23 €29 -0.75 -0.87 y29 239.71 


Figure 1 shows the fluctuations of the synthetically generated AR(1) data from the measured hydrological yearly averaged 
rainfall values from 1975-75 to 2003-04. 
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Figure 1. Synthetically generated AR(1) model and the respective measured rainfalls of Central Mesaria for the period of 
hydrologic years 1975-76 to 2003- 04 


2.7 Forecasting hydrologic yearly averaged rainfall of Central Mesaria through AR(1) Model: 


Based on the above discussed methodology, AR(1) model was used to generate the forecasted hydrologic yearly average 


rainfall of Central Mesaria for the period of to 2004-05 to 2013-14 years. Details are given in Table 6. 


Table 6. Forecasted rainfall by AR(1) for central mesaria for the period of hydrologic years 2004-05 to 2013-14. 


uniform random Standard normalrandom — Residual for AR(1) yi values for Forecasted synthetic data 
number number model AR(1) model Yi 
130 0.11 Z30 0.76 £30 0.69 0.87 y30 366.1 
N31 0.30 Z31 -1.27 C31 -1.16 -0.82 y3l 243.0 
132 0.60 Z32 -0.87 €32 -0.79 -1.11 y32 221.8 
33 0.04 233 -0.46 €33 -0.42 -0.85 y33 241.2 
134 0.28 234 2.47 €34 2.24 1.92 y34 442.8 
35 0.10 235 2.16 €35 1.97 pag al y35 500.5 
136 0.01 236 0.14 €36 0.13 1.18 36 388.7 
37 0.83 237 0.03 €37 0.02 0.48 y37 337.9 
138 0.76 Z38 -0.60 €38 -0.55 -0.36 y38 276.4 
39 0.30 Z39 1.55 €39 1.41 heed y39 395.4 


The graphical representation of this forecasted data set are shown in Figure 2. 
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2.8 Holt-Winter method 


The Holt-Winters seasonal method includes the forecast equation and three smoothing equations , one for the level 1,, one for 
trend b,, and one for the seasonal component denoted by S;, with smoothing parameters a, 6 and y. There are two variations to 
this method that differ in the nature of the seasonal component. The additive method is preferred when the seasonal variations 
are roughly constant through the series, while the multiplicative method is preferred [12-14]. In this study Holt-Winter 


Multiplicative Model is adopted using Minitab 1 6software. 


2.9 Holt-Winter Multiplicative method 





Fz = (lp + hdy) Seman (20) 
=a (PE) + (1 a) (lea + bea) (20a) 
b, = BCL — Le_-1) + 1 — BYDe-1 (20b) 
SS VG +(1—y)Stom (20c) 


and the error correction form of the smoothing equations is: 





l, = L4 + De-4 + ae (21) 
t—m 
b, = be_-1 + 1 (21a) 
t-—m 

a et 

eee ee (l¢-1+bt-1) (21b) 
where: 

Cp = Ve — Cea + Dea) Stem = Vi — Vt (21c) 


2.10 Accuracy Measures of the Forecasted Models 


when selecting a forecasting model, or when evaluating an existing model, one has to use measures that summarise the overall 
accuracy provided by that model [15-19]. Beside the visual comparison, the suggested models performances are also evaluated 
using the below mentioned statistical accuracy measures: 


e The mean absolute deviation (MAD), 


1 x 
MAD = ~ bt=11Ve — J, (22a) 
e The mean square error (MSE), 


1 $5 
MSE = ~ Lt=1 Ve = d.)° (22b) 


e The root mean square error (RMSE), and 


E & 
RMSE = ~ t=1 Ve = 9)" (22c) 


e The mean absolute percentage error (MAPE) . 


MAPE =-Yr., * 100 (22d) 





ve-St 
Yt 





3. Results and Discussions 
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The time series models with forecasting values was used to understand the occurrence of the random mechanism so as to 
predict future series based on the past data. For time series model studies, the gathered rainfall values were initially divided into 
two time series parts, 

1. from September 1975 to August 2003, and 
2. from September 2004 to August 2014. 


The data within the first part was used to train the model and the data set of the second part was used for comparing the error 
accuracy of the suggested models based on the previously trained data [20][21]. Once the best fitted model was obtained within 
the acceptable confidence limits, since the sample size is comparably small (n=39), only 5 years ahead values were predicted 


based on the most representative model, 1.e., up to 2018- 19. 


3.1. Forecasted values by time series models of Central Mesaria rainfall for the hydrologic years period 2003-04 to 2013-14 


3.1.1. Markov Model 
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Figure 3. Graphical comparison of Markov model (trained and forecasted) and the hydrologic yearly averaged rainfall (measured) of Central 
Mesaria 


3.1.2. Auto-Regressive (AR) Model 
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Figure 4. Graphical comparison of AR(1) model (trained and forecasted) and the hydrologic yearly averaged rainfall (measured) of Central 
Mesaria 
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3.1.3. Holt-Winter Multiplicative Model 
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Figure 5. Graphical comparison of Holt-Winter Multipilicative method model (trained and forecasted) and the hydrologic yearly averaged 


rainfall (measured) of Central Mesaria 


Result : AR(1)model is the best model among the others, having the lowest overall error ratio based on the given 4 


standardized error measures. Hence, for Central Mesaria Region, AR(1) model is used to generate (predict) the rainfall for the 


hydrologic years 2014-15 to 2018-19 which are tabulated below. 


Table 7. Expected yearly rainfall of Central Mesaria region based on AR(1) model for hydrologic years 2014-15 to 2018- 19 


rainfall (mm) 


1975-1976 


1977-1978 


Hydrologic Years 
2014-15 
2015-16 
2016-17 
2017-18 
2018-19 


Measured rainfall 
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ome FOrecasted 
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1985-1986 | 


1981-1982 _ 
1987-1988 | 











1989-1990 | 


1991-1992 


Expected yearly rainfall (mm) 








1993-1994 


269.4 
244.6 
238.0 
2179 
219.6 











hydrologic year 


2001-2002 


Trained 





2003-2004 





Predicted 








Figure 6. Expected (predicted) yearly rainfall of Central Mesaria region based on AR(1) model for hydrologic years 2014-15 to 2018-19 


The mentioned model were applied on other 5 regions and TRNC( general) regions . The synopsis of the time series models 


of each meteorological region and TRNC as a whole are tabulated below. 
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Table 8. Synopsis of the rainfall data parameters and time series models of each meteorological region and TRNC as a whole 


Region Expected yearly rainfall / Bellow () or above () long years rainfall means 
iol) 


5 8 

x -Representive time series model ( best model) 2014-15 2015-16 2016-17 2017-18 2018-19 
Central Mesaria299.9 AR(1) 2094 | 2446 | 2380 |. 2175 #\J| 2196 | 
East Coast 334.7 Holt-Winter Mult. 3045 | 4415 7 353.1 fT 258.8 | 2746 | 
East Mesaria 334.2 AR(1) 318.4 | 3384 7 £44203 7 4=%3049 | 3305 | 
Karpaz 449.7 Holt-Winter Mult. 413.9 | 601.2 ff 44788 ff 372.7 | 3569 | 
North Coast 461.8 AR(1) 553.0 f 325.8 | 371.3 \)| 5696 fT 4469.8 f 
West Mesaria 310.8 Holt-Winter Mult. 383.0 ft 3903 7 400.1 ff 322.0 fT 387.2 fF 
TRNC (general)378.8 AR(3) 281.2 | 355.7 | 4024 7 3575 | 3236 | 


4. Conclusion 


Rainfall of the 6 meteorological regions of North Cyprus and TRNC as a whole were analyzed by three widely used time 
series models (Markov, Auto-Regressive, and Holt-Winter Multiplicative) and five successive years of predicting data sets (from 
hydrologic years 2014-15 to 2018-19) were generated. For this purpose, to determine the best time series model for each regions 
and for TRNC, the standardized MSE, MAPE, RMSE, and MAD were used. 

Based on the best representative time series predictions of the five successive years, except North Coast and West Mesaria 
for hydrologic year 2014-15 will experience lower than the long yearly average rainfall value implying dryness whereas, TRNC 
will experience higher than its long yearly average during the hydrologic year 2016-17 implying wet period. The other relevant 
details are given in the following table. It is worth to express that for the coming five hydrologic years (2014-15 to 2018-19) 
West Mesaria will experience total rainfall more than its long years average for all those years whereas Central Mesaria will 


experience total rainfall less than its long years average for all those years. 
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